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Abstract 

We study how a mobile robot can piecemeal learn an unknown environment. The 
robot's goal is to learn a complete map of its environment, while satisfying the con- 
straint that it must return every so often to its starting position s (for refueling, say). 
The environment is modelled as an arbitrary, undirected graph, which is initially un- 
known to the robot. We assume that the robot can distinguish vertices and edges 
which it has already explored. We present a surprisingly efficient algorithm for piece- 
meal learning an unknown undirected graph G = (V, E) in which the robot explores 
every vertex and edge in G by traversing at most 0(E + V l+0 ^) edges. This nearly 
linear algorithm improves on the best previous algorithm, in which the robot traverses 
at most 0(E + V 2 ) edges. 

We also address the related problem of searching a graph for a particular distin- 
guished location or treasure. If this treasure is likely to be near s, then the robot 
should explore in a breadth-first manner. We show how the robot can explore while 
never being much further than 8 away from s, where 8 is the shortest path distance 
from s of the unvisited vertex nearest to s. We show that if the robot is never more 
than 8 away from s (as in traditional BFS), then there are graphs for which the robot 
traverses fl(E 2 ) edges. In the algorithm we give, the robot traverses 0(E + V l+0 ^) 
edges, and maintains the following property: if 8 is the distance from s to the unvisited 
vertex nearest to s, then the robot is never further than 8 + o(8) away from s. 



*We gratefully acknowledge support from NSF grant CCR-9310888, ARO grant DAAL03-86-K-0171, NSF 
grant 9217041-ASC, Air Force Contract TNDGAFOSR-86-0078, ARPA/Army contract DABT63-93-C-0038, 
NSF contract 9114440-CCR, DARPA contract N00014-J-92-1799, the Siemens Corporation, and a special 
grant from IBM. The authors can be reached at baruchQblaze . cs . jhu. edu, margritfitheory . lcs .mit . edu, 
rivestfitheory . lcs . mit . edu, and monafitheory . lcs . mit . edu. 

lAlso at Johns Hopkins University, Baltimore, MD 21218. 



1 Introduction 

Environment learning and algorithmic motion planning for robots have recently become ac- 
tive research areas. The goal is to find efficient algorithms for a robot to learn about or 
navigate in its environment. Such algorithms are now useful in practice: there are working 
meal delivery robots in hospitals [15], and vehicles that navigate autonomously on high- 
ways [4]. More formal theoretical approaches to these problems have also been studied 
extensively (e.g., [20, 10, 22, 5]). 

We study the problem of piecemeal learning of an unknown environment [8]. The robot's 
goal is to learn a complete map of its environment while satisfying the piecemeal constraint 
that learning must be done a "piece at a time," with the robot returning to the starting 
point s after each learning phase. Why might mobile robot exploration be done piecemeal? 
Robots may explore environments that are too risky or costly for humans: the inside of a 
volcano (e.g., CMU's Dante II robot) or the surface of Mars. Or, the robot's hardware may 
be too expensive or fragile to stay long in dangerous conditions. Thus, it may be best to 
organize the learning into phases, allowing the robot to return to s before it breaks down 
or runs out of power. At the start position s, the robot can cool off, recharge, or drop off 
samples collected. 

Approaches to modelling a robot's environment come from graph theory, computational 
geometry, on-line algorithms, and the theory of finite automata. The model used here 
was introduced by Betke, Rivest, and Singh [8]. The robot's task is to learn an unknown 
environment modeled as an undirected graph G = ( V, E) in a piecemeal manner. The robot's 
efficiency (or running time) is measured in terms of the number of edges traversed. The main 
difficulty in our work lies in designing efficient, but analyzable, robot exploration algorithms. 
We give an almost linear time algorithm: it achieves 0(E + V 1+0 ^ 1 ') running time. The most 
efficient previously known algorithm has 0(E + V 2 ) running time. We also give a simpler 
but less efficient algorithm that runs in 0(E + V 1 ' 5 ) time. 

A robot can explore grid-graphs with rectangular obstacles in a piecemeal manner in 
linear time, if the robot is given a bound on the number of edges it may traverse in each 
learning phase (Betke, Rivest, and Singh [8]). We extend these results to show that the 
robot can learn any undirected graph piecemeal in almost linear time. It is open whether 
arbitrary, undirected graphs can be learned piecemeal in linear time. 

The piecemeal constraint is most naturally satisfied by requiring the robot to explore in 
a near breadth-hrst manner, so that it is never much further away from s than necessary 
to visit any unexplored vertex. In this manner, returns to s are efficient. Breadth-hrst 
search (BFS) on unknown graphs is also an important problem in its own right, with many 
applications. We consider one such application, treasure hunting, where the goal is to find 
a treasure (or a lost child, or a particular landmark) that is believed to be near s. If the 
robot knows that the treasure is close to its goal location, it should explore in a breadth-hrst 
manner from its current position. 

BFS is a classic technique for searching graphs [19, 18, 11]. However, standard BFS is 



efficient only when the robot can efficiently switch or "teleport" from expanding one vertex to 
expanding another. In contrast, our model assumes a more natural scenario where the robot 
must physically move from one vertex to the next. We change the classical BFS model to a 
more difficult teleport-free exploration model, and give efficient approximate BFS algorithms: 
algorithms that satisfy the "approximate BFS constraint" (the robot does not move much 
further away from s than the distance from s to the unvisited vertex nearest to s). Our 
hrst teleport-free BFS algorithms never visit a vertex more than twice as far from s as 
the nearest unvisited vertex is from s. Our final teleport-free BFS algorithm satisfies the 
stronger condition that if the closest unvisited vertex to s is distance 8 away, the robot is 
never more than 8 + 0(8) away from s. For the treasure hunting problem, if the treasure is 
at a vertex that has shortest path distance 8j away from s, then the robot traverses at most 
0(E + y 1+0 ( 1 )) edges, where E and V are the number of edges and vertices within radius 
A = St + o(8t) from s. In contrast, we give a simple example to show that if the robot 
exactly satisfies the traditional BFS constraint (i.e., it cannot move further away from s 
than the unvisited vertex nearest to s), then it may traverse up to 0(E 2 ) edges. Our final 
treasure hunting algorithm is also a solution to the piecemeal learning problem, but it is 
more complicated than our fastest piecemeal learning algorithm. 

Previous work 

Papadimitriou and Yanakakis [20] developed one of the hrst models for exploring unknown 
environments. They show how to find a shortest path in an unknown, undirected graph. 
Deng and Papadimitriou [13] and Betke [6] address the problem of learning an unknown di- 
rected graph. Bender and Slonim [5] show how two cooperating robots can learn a directed 
graph. Rivest and Schapire [22] model the robot's unknown environment by a determin- 
istic finite automaton. They describe algorithms that efficiently infer the structure of the 
automaton through experimentation. Deng, Kameda, and Papadimitriou [12] consider the 
how to learn the interior of a two-dimensional room. Blum, Raghavan, and Schieber [10] 
consider a robot navigating in an unknown two-dimensional geometric terrain with convex 
obstacles. Bar-Eli, Berman, Fiat, and Yan [3] give an efficient algorithm for reaching the 
center of a two-dimensional room with obstacles. Betke [7] and Kleinberg [17] address the 
problem of localizing a mobile robot in its environment. Blum and Chalasani [9] consider 
the problem of finding a "k-trip" shortest path in the environment. There are many other 
related papers in the literature [16, 14]. Rao, Kareti, Shi, and Iyengar [21] give a survey of 
work on "robot navigation in unknown terrains." 

Our techniques are inspired by the work of Awerbuch and Gallager [1, 2]. We observe 
that our learning model bears some similarity to the asynchronous distributed model. This 
similarity is surprising and has not been explored in the past. 



2 Model and statement of main results 

We model the robot's environment as a finite undirected graph G = (V } E) with a distin- 
guished start vertex s. The graph is initially unknown to the robot. Each vertex in the graph 
represents an accessible location, and each edge represents a connection between adjacent 
locations. During each step of exploration, the robot moves from its current location to an 
adjacent location; it is not allowed to "teleport" from one vertex to another distant vertex. 
The robot can recognize previously visited vertices. The robot can distinguish the edges 
incident to its current vertex and it knows which edges it has traversed already, but it has 
no vision or long-range sensors. The robot incurs a cost only for traversing an edge; thinking 
and path planning (computation) are free. 

We consider two closely related constraints on the exploration: the "piecemeal constraint" 
to model learning unknown environments in phases, and the "approximate BFS constraint" 
to model exploring an unknown graph in order to find a treasure. 

2.1 Piecemeal Learning 

The robot's goal in piecemeal learning is to explore its entire (unknown) environment, while 
satisfying the piecemeal constraint that it must return every so often to its starting point. To 
assure that the learner can reach any vertex in the graph, do some exploration, and then get 
back to the start vertex, we assume the robot may traverse (2-\-a)R edges in one exploration 
phase, where a > is some constant and R is the radius of the graph (the maximum of all 
shortest path distances between s and any vertex in G). 

We say an exploration efficiently interruptible if the robot always knows a path of explored 
edges of length at most R back to s. 

Theorem 1 An efficiently interruptible algorithm for exploring an unknown graph G = 
( V, E) with n vertices and m edges that takes time T(n, m) can be transformed into a piece- 
meal learning algorithm that takes time 0(T(n,m)). 

The proof of this theorem is similar to one shown by Betke, Rivest, and Singh in a 
previous paper [8]. 

All the algorithms we present in this paper are efficiently interruptible, and thus give 
efficient piecemeal learning algorithms for undirected graphs. Our main theorem is: 

Theorem 2 Piecemeal learning of a general undirected graph G = ( V, E) can be done in 
time 0{E + V 1+0 ^). 

Proof sketch: In the RECURSIVE STRIP-ALGORITHM, given in Section 5, the robot always 
knows a path from its current location back to the source vertex of length at most the radius 
of the graph. We discuss the running time of this algorithm in Section 5. The bound is 
0(E + y2°^ V los1/loslo§1/ )). By Theorem 1, this algorithm can be interrupted efficiently to 
give a piecemeal learning algorithm. □ 



2.2 Treasure Hunting 

If the robot's goal is to explore an unknown environment in order to find a treasure that is 
believed to be the near s, then the robot should explore in a breadth-hrst manner. 

In traditional BFS, the robot may not move further away from the source than the 
unvisited vertex nearest to the source. At any given time in the algorithm, let A denote 
the (shortest-path) distance from s to the vertex the robot is visiting, and let 8 denote the 
(shortest-path) distance from s to the vertex nearest to s that is as yet unvisited. With 
traditional breadth-hrst search we have A < 8 (actually A = 8) at all times. With teleport- 
free exploration, it is generally impossible to maintain A < 8 without a great loss of efficiency: 

Lemma 1 A robot which maintains A < 8 (such as a traditional BFS) may traverse ft(E 2 ) 
edges. 

Proof: Consider a graph with vertices {— n, — ra + 1, . . . , — 1, 0, 1, 2, . . . , n — 1, n}, where s = 
and edges connect consecutive integers. To achieve A < 8, a teleport-free BFS algorithm 
would run in quadratic time, traveling back and forth from I to —I to —2 to 2 to 3 ... . □ 

Given this lower bound, we solve the treasure hunting problem efficiently while maintain- 
ing the "approximate BFS constraint." Our initial algorithms STRIP-EXPLORE, ITERATIVE- 
STRIP, and RECURSIVE-STRIP, described in Sections 3, 4, and 5, maintain A < 28: the robot 
is never more than twice as far from s as is the nearest unvisited vertex. Our final algo- 
rithm TREASURE-SEARCH, given in Section 6, satisfies the stronger condition A = 8 + o(8). 
Note that this algorithm is also efficiently interruptible and thus can also be used to solve 
the piecemeal learning problem; however, it is more complicated. Our main theorem about 
treasure hunting is: 

Theorem 3 Given an unknown graph with a treasure at distance 8j from s, a robot can 
find the treasure while getting at most distance A = 8j + o(8t) away from the source vertex, 
with an algorithm of running time 0(E + V 1+0 ^ 1 '), where E and V are the total number of 
distinct edges and vertices within radius A from the source. 

Proof sketch: The algorithm TREASURE-SEARCH given in Section 6 satisfies the theorem. 
We discuss the properties of this algorithm in Section 6. □ 

3 An exploration algorithm: Strip-Explore 

This section describes an efficiently interruptible algorithm for undirected graphs with run- 
ning time 0(E + V 1 ' 5 ). It is based on breadth-hrst search. 

A layer in a BFS tree consists of vertices that have the same shortest path distance to 
the start vertex. A frontier vertex is a vertex that is incident to unexplored edges. A frontier 
vertex is expanded when the robot has traversed all the unexplored edges incident to it. 



The traditional BFS algorithm expands frontier vertices layer by layer. In the teleport- 
free model, this algorithm runs in time 0(E + rV), since expanding all the vertices takes 
time 0(E) , and visiting all the frontier vertices on layer i can be performed with a depth- 
first search of layers I . . . i in time 0(V), and there are at most r layers. The procedure 
LOCAL-BFS describes a version of the traditional BFS procedure that has been modified for 
our teleport-free BFS model in two respects. First, the robot does not relocate to frontier 
vertices that have no unexplored edges. Second, it only explores vertices within a given 
distance-bound L of the given source vertex s. (The hrst modification, while seemingly 
straightforward, is essential for our analysis of our more complex algorithms that use LOCAL- 
BFS as a subroutine at various source vertices). A procedure call of the form LOCAL- 
BFS(s,r), where s is the source vertex of the graph and r is its radius, would cause the 
robot to explore the entire graph. 

Awerbuch and Gallager [I, 2] give a distributed BFS algorithm which partitions the 
network in strips, where each strip is a group of L consecutive layers. (Here L is a parameter 
to be chosen.) All vertices in strip i — 1 are expanded before any vertices in strip i are 
expanded. Their algorithms use as a subroutine breadth-hrst type searches with distance L. 

Local-BFS(s,Z) 

1 for i = to L — 1 do 

2 let current-verts = all vertices at distance i from s 

3 for each u G current-verts do 

4 if u has any incident unexplored edges 

5 then 

6 Relocate to u 

7 Traverse each unexplored edge incident to u 

8 relocate to s 

Our algorithm, STRIP-EXPLORE, uses the idea of search in strips in a new way. See 
Figure 1. The robot explores the graph in strips of width L. First the robot does LOCAL- 
BFS(s,iv) to explore the hrst strip. It then explores the second strip as follows. Suppose 
there are k frontier vertices Vi 7 v 2} . . . , v^ in layer L\ each such vertex is a source vertex for 
exploring the second strip. A naive way for exploring the second strip is for the robot for 
each z, to relocate to Vi, and then find all vertices that are within distance L of V{ by doing a 
BFS of distance-bound L from V{ within the second strip. The robot thus traverses a forest 
of k BFS trees of depth L, completely exploring the second strip. The robot then has a map 
of the BFS tree of depth L for the hrst strip and a map of the BFS forest for the second strip, 
enabling it to create a BFS tree of depth 2L for the hrst two strips. The robot continues, 
strip by strip, until the entire graph is explored. 

The naive algorithm described above is inefficient, due to the to overlap between the trees 
in the forest at a given level, causing portions of each strip to be repeatedly re-explored. 

The STRIP-EXPLORE presented below solves this problem by using the LOCAL-BFS 
procedure as the basic subroutine, instead of using a naive BFS. (See Figure 2.) Using this 



algorithm, the explorer searches in a breadth-hrst manner, but ignores previously explored 
territory. The only time the robot traverses edges which have been previously explored is 
when moving to a frontier vertex it is about to expand. This results in retraversal of some 
edges in previously explored territory, but not as many as in the naive algorithm. 



Strip-Explore(s, L, r) 

1 numstrips = \r/L~\ 

2 sources = {s} 

3 for i = 1 to numstrips do 

4 for each u G sources do s 

5 Relocate to u 

6 Local-BFS(u,Z) 

7 sources = all frontier vertices 




depth r 



strip of depth L 



Figure 1: The Strip-Explore. In the naive algorithm, the shaded areas are retraversed com- 
pletely. In the strip algorithm, the shaded areas are passed through more than once only if necessary 
to get to frontier vertices. 



s =* f, 



Figure 2. Contrasting BFS and Local-BFS: Consider doing a BFS of depth 
5 from si, followed by a BFS of depth 5 from «2. (The depth of the strip is 
L = 5.) The BFS from S2 revisits vertices a, b, c, d, e. On the other hand, if 
the BFS from si is followed by a Local-BFS from S2, then it only revisits 
d, c, e. After collision edge (/, d) is found, vertex e is a frontier vertex that 
needs to be expanded. 



Theorem 4 STRIP-EXPLORE 



runs in 



time 0(E + V 



1.5\ 



Proof: First we count edge traversals for relocating between source vertices for a given strip. 
For these relocations, the robot can mentally construct a tree in the known graph connecting 
these vertices, and then move between source vertices by doing a depth-first traversal of this 
tree. Thus the number of edge traversals due to relocations between source vertices for this 
strip is at most 2V . Since there are \r/L~\ strips, the total number of edge traversals due to 
relocations between source vertices is at most 2rV/ L + 2r. 

Now we count edge traversals for repeatedly executing the LOCAL-BFS algorithm. First 
for the robot to expand all vertices and explore all edges it traverses 2E edges. Next, each 
time line 8 of procedure LOCAL-BFS is called, at most L edges are traversed. To account for 
relocations in line 6 of procedure LOCAL-BFS, we use the following scheme for "charging" 
edge traversals. Say the robot is within a call of the LOCAL-BFS algorithm. It has just 
expanded a vertex u and will now relocate to a vertex v to expand it. Vertex v is charged 
for the edges traversed to relocate from u to v. (We are only considering relocations within 
the same call of the LOCAL-BFS algorithm; relocations between calls of the LOCAL-BFS 
algorithm were considered above.) Source vertices are not charged anything. Moreover, the 



robot can always relocate from u to v by going from u to the source vertex of the current local 
BFS, and then to v, traversing at most 2L edges. Thus, each vertex is charged at most 2L 
when it is expanded. LOCAL-BFS never relocates to a vertex v unless it can expand vertex 
v (i.e., unless v is adjacent to unexplored edges). Thus, all relocations are charged to the 
expansion of some vertex, and the total number of edge traversals due to relocation is at 
most 2LV . 

Thus the total number of edge traversals is at most 2rV/L + 2r + 3LV + 2E } which is 



0(rV/L + r + LV). When L is chosen to be ^fr-^ this gives 0(E + V ) edge traversals. □ 

It is easy to show for STRIP- EXPLORE, and the generalizations of it given in later sections, 
that A < 26 at all times; the worst case is when the treasure is at the beginning of the second 
strip. 



4 Iterative strip algorithm 

In this section, we describe ITERATIVE- STRIP, an algorithm similar to the STRIP-BFS 
algorithm. It is an efficiently interruptible algorithm for undirected graphs inspired by 
Awerbuch and Gallager's [1] distributed iterative BFS algorithm. Although its running time 
of Ody 1 ' 5 + E) log V) is worse than the running time of STRIP-BFS, its recursive version 
(described in the next section) is more efficient than STRIP-BFS. (It is not clear how to 
recursively implement STRIP-BFS as efficiently, because the search trees in a strip are not 
disjoint.) 

Following the ITERATIVE- STRIP, the robot grows a global BFS tree with root s strip by 
strip, similar to STRIP-BFS. Unlike STRIP-BFS, each strip is processed several times before 
it has correctly deepened the BFS tree by -^/r. We next explain the algorithm's behavior 
on a typical strip by describing how a strip is processed for the hrst time, and then for the 
remaining iterations. 




depth D 



Figure 3: The iterative strip algorithm after the first iteration on the fourth strip. Two connected 
components Ci,c 2 have been explored. The collision edges e x and e 2 connect the first three ap- 
proximate BFS trees. The dashed line shows how source vertices Si,s 2 ,s 3 connect within the strip. 
There are three active frontier vertices with depth less than D + yfr. 



In the first iteration, a strip is explored much as in the STRIP-BFS. The robot explores 
a tree of depth ^fr from each source vertex, by exploring in breadth-hrst manner from each 
source vertex, without re-exploring previous trees. Whenever the robot finds a collision edge 
connecting the current tree to another tree in the same strip, it does not penetrate into the 
other tree. Unlike STRIP-BFS, the robot does not traverse explored edges to get to the active 
frontier vertices on other trees. Therefore, after the first iteration, the trees explored are 
approximate BFS trees, which may have frontier vertices with depth less than ^Jr from some 
source vertex. These vertices become active frontier vertices for the next iteration. Thus, 
the current strip may not yet extend the global BFS tree by depth y/r, so more iterations 
are needed until all frontier vertices are inactive and the global BFS tree is extended by 
depth \fr (see Figure 3). 

In the second iteration (see Figure 4), the robot uses the property that two trees connected 
by a collision edge form a connected component within the strip. (The graph to be explored is 
connected, and thus forms one connected component; but we refer to connected components 
of the explored portion of the graph contained within the strip.) The robot does not have 
to traverse any edges outside of the current strip to relocate between these active frontier 
vertices in the same connected component. In the second and later iterations, the robot 
works on one connected component at a time. 



finished strip 




depth D 



depth D + ■fx 



Figure 4: The iterative strip algorithm after the second iteration. Now the circled vertices which 
were active frontier vertices at the beginning of the iteration are expanded. One of the expansions 
resulted in a collision edge. Now the strip consists of only one connected component (shaded area). 
There are six frontier vertices which become source vertices of the next strip. All frontier vertices 
have depth D + y/r. 



The robot explores active frontier vertices in one connected component as follows. He 
computes (mentally) a spanning tree of the vertices in the current strip. This spanning tree 
lies within the strip. Let d be the least depth of any active frontier vertex in the component 
from a source vertex. He visits the vertices in the strip in an order determined by a DFS of 
the spanning tree. As it visits active frontier vertices of depth d, it expands them. He then 
recomputes the spanning tree (since the component may now have new vertices) and again 
traverses the tree, expanding vertices of the appropriate next depth d 1 . Traversing a collision 
edge does not add the new vertex to the tree, since this vertex has been explored before. 
This process continues (at most \fr times) until no active frontier vertex in the connected 
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Iterative- STRip(s,r) 

1 for i = 1 to y/r do 

2 for each source vertex u in strip i do 

3 Relocate to u 

4 BFS from u to depth y 7 ?, but do not enter previously explored territory 

5 while there are any active connected components iterate 

6 for each active connected component c do 

7 repeat 

8 let Vi, t> 2 , t>3, . . . be active frontier vertices exclusively in c 

with smallest depth among active frontier vertices in c 

9 relocate to each of Vi, v 2 , v 3 , . . ., and expand 

10 until no more active frontier vertices exclusively in c 

11 determine new and active connected components 



component has distance less than \fr from some source vertex in the component. 

The robot processes each connected component in turn, as described above. Then the 
next iteration starts in which it combines the components now connected by new collision 
edges and explores the new active frontier vertices in those components. Lemma 2 states 
that at most log V iterations cause all frontier vertices to not be active any more; then the 
only active frontier vertices are the new sources of the next strip. 

Lemma 2 At most log V iterations per strip are needed to explore a strip and extend the 
global BFS tree by depth y/r. 



Theorem 5 ITERATIVE- STRIP runs in time 0((E + V^-^log V). 

Proof sketches are included in the Appendix. □ 

5 The recursive strip algorithm 

In this section, we give an efficiently interruptible algorithm RECURSIVE- STRIP, which gives 
a piecemeal learning algorithm which traverses at most 0(E + y 1+0 ( 1 )) edges. RECURSIVE- 
STRIP is the recursive version of ITERATIVE- STRIP; it provides a recursive structure which 
coordinates the exploration of strips, of approximate BFS trees, and of connected components 
in a different manner. The robot still, however, builds a (global) BFS tree from start vertex 
s strip by strip. The robot expands vertices at the bottom level of recursion. 

In this algorithm, the depth of each strip depends on the level of recursion (see Figure 5). 
If there are k levels of recursion, then the algorithm starts at the top level by splitting the 
search of G into V/d^-i strips of depth dk-\. Each of these is split into dk-i/dk-2 searches 
of strips of depth (4_2, etc. We have V = dk > <4-i > • • • > d\ > d = 1. 

9 



Recursive-Strip (sources, depth, T) 

1 if depth = 1 

2 then let Vi, v 2 , ■ ■ ■ , v k be the depth-first ordering of sources in spanning tree of sources 

3 for i = 1 to k do 

4 relocate to v, 

5 if v i has adjacent unexplored edges then traverse v^s incident edges 

6 T = TU {newly discovered vertices} 

7 return 

8 else determine next depth 

9 number-of- strips <— depth/next-depth 

10 for i = 1 to number-of- strips do 

11 determine set of source vertices 

12 for j = 1 to number-of-iterations do 

13 partition vertices in T into maximal sets T l7 T 2 , ■ ■ ■ ,T k such that 

vertices in each T c are known to be connected within strip i 

14 for each T c in suitable order do 

15 let S c be the source vertices in T c 

16 Relocate to some source s £ S c 

17 Recursive-Strip(5 c , next-depth, T c ) 

18 T = T U T c 

19 Relocate to some s G sources 

20 return 



depth L' 



depth r 




unexplored 
territory 



strip of depth L 



Figure 5: The recursive strip algorithm processing an approximate BFS tree from source vertex s 2 
to depth d k _i = L. Recursive calls within the tree are of depth d k _ 2 = L' . 
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Each recursive call of the algorithm is passed a set of source vertices sources, the depth 
to which it must search, and a set T of all vertices in the strip already known to be less 
than distance depth from one of the sources. The algorithm traverses all edges and visits 
all vertices within distance depth of the sources that have not yet been processed by other 
recursive calls at this level. RECURSIVE-STRIP({s}, r, {s}) is called to explore the entire 
graph. 

At recursion level z, the algorithm divides the search into strips and processes each strip in 
turn, as follows. Suppose the strip has / source vertices v^, . . . , vi. The strip is processed in at 
most log / = O(log V) iterations. In each iteration, the algorithm partitions T into maximal 
sets Ti, T 2 , . . . , Tk such that each set known to be connected within the strip. Let S c denote 
the source vertices in T c . A DFS of the spanning tree of the vertices T gives an order for the 
source vertices in Si, S2, . . . , Sk' : this spanning tree is used for efficient relocations between 
these source vertices. Note that all source vertices are known to be connected through the 
spanning tree of the vertices in T, but they might not be connected within the substrips. 
Since relocations between the vertices in S c in the next level of recursion use a spanning tree 
of T c , for efficiency the vertices of T c must be connected within the strip. After partitioning 
the vertices into connected components within the strip, for each connected component T c , 
the algorithm relocates (along a spanning tree) to some arbitrary source vertex in S c . It then 
calls the algorithm recursively with S c , the depth of the strip, and the vertices T c which are 
connected to the sources S c within the strip. 

The remaining iterations in the strip combine the connected components until the strip 
is finished. Then the algorithm continues with the next strip in the same level of recursion, 
or, if it finished the last strip, it relocates to its starting position and returns to the next 
higher level of recursion. 

Theorem 6 RECURSIVE-STRIP runs in time 0(E + y 1+0 ( 1 )). 

A proof sketch included in the Appendix. □ 

6 Searching a graph for a treasure 

We now consider the problem of searching for a treasure in a potentially infinite graph 
G = (V,E). We give the procedure TREASURE-SEARCH, which uses the RECURSIVE-STRIP 
algorithm as a subroutine. If the treasure is at a location which is distance St away from 
the source vertex, this algorithm maintains the condition that the robot is never further 
from the source than A, where A < St + o(St)- Procedure TREASURE-SEARCH traverses 
0(E + y 1+0 ( 1 )) edges, where E and V are the total number of distinct edges and vertices 
within radius A from the source. 

The robot searches the graph for the treasure in phases. In each phase z, the robot calls 
RECURSIVE- STRIP to search a strip in the graph. The size of the strips changes over time. 
The change depends on ei, e 2 , e 3 , . . ., where e 8 - = 1/yi. Initially, the robot explores the graph 
out to distance r\ = 1 + ei. Next, the robot extends his search by a factor of 1 + e 2 . That is, 
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the size of the next strip is (l + ei)(l + e 2 ) — (1 + ei), and the robot now knows the graph 
out to distance r 2 = (1 + ei)(l + £2)- After extending the next strip, the robot knows the 
graph out to distance r 3 = (1 + ei)(l + e 2 )(l + e 3 ), and so on. 



Treasu re- Search (s) 




1 i = 




2 i-o = l 




3 Do until treasure is found: 




4 i = i+ 1 




5 €{ = 1/v* 




6 ri = r i _ 1 ■ (1 + e t ) 




7 let S be set of source vertices distance r,. 


_i away from s 


8 Recursive-Strip^, r t - r t _ u S) 





The correctness of this procedure can be shown by the following lemmas, which can be 
used to prove Theorem 3. 

Lemma 3 The number of phases of procedure TREASURE-SEARCH is at least log St and at 
most In St- 

Proof sketch: The number of phases is at least j where (1 + e^) 3 > St- The number of 
phases is at most m where (1 + e m ) m > St- We can show the lemma using these inequalities. 



Lemma 4 The robot is never further than St + St /\/log St away from the source. 

Proof sketch: Let A be the furthest distance the robot gets from the source vertex. In 
the TREASURE-SEARCH algorithm, the most that A and St differ is STe maX} where max is 
the number of phases that need to be explored to get out to depth St- Lemma 3 shows that 
the total number of strips explored is at least log<5x. Thus, e max is at most 1/ydogJx, and 
A < St + St /\/log St = St + o(St)- n 



7 Open problems 

We have presented an efficient 0(E + y 1+0 ( 1 )) algorithm for piecemeal learning of arbitrary, 
undirected graphs. The only lower bound known for this problem is the trivial linear bound 
ft(E + V). It is open whether there is a linear-time algorithm for piecemeal learning of 
general graphs. 

We have also given an algorithm for the application of treasure hunting on potentially 
infinite graphs that runs in time nearly linear in E and V, where E and V are the number of 
distinct edges and vertices within radius A from the start vertex. Is it possible (we conjecture 
not) to find a treasure in time nearly linear in the number of those vertices and edges whose 
distance to the source is less than or equal to that of the treasure? 
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Appendix 

We include in this Appendix some proof sketches of the theorems in the paper. 

Lemma 2 At most log V iterations per strip are needed to explore a strip and extend the 
global BFS tree by depth yfr . 

Proof sketch: If there are initially / source vertices, then after the hrst iteration there 
are at most / connected components. If a component does not collide with another active 
component, then it will have no active frontier vertices for the next iteration. Thus, each 
iteration halves the number of components with active frontier vertices. After at most log V 
iterations there is no connected component with active frontier vertices left. The robot then 
has a complete map of the current strip and of the global BFS tree built in previous strips, 
so he can combine this information and extend the global BFS tree by depth yfr . □ 



Theorem 5 ITERATIVE-STRIP-BFS runs in time 0((E + V^log V). 

Proof sketch: We hrst count the number of edge traversals per strip. Let Vi and Ei be the 
number of vertices and edges explored in strip i. For each component, vertices of distance t 
from some source vertex are expanded by computing a spanning tree of the component, 
doing a DFS of the spanning tree, and expanding all vertices of distance t from some source 
vertex (lines 8,9). At each iteration (line 6), components are disjoint, so relocating to all 
vertices in the strip of distance exactly t takes at most 0(V) edge traversals. Thus, in one 
iteration, relocating to all vertices in the strip within distance yfr takes at most 0(y/rVi) edge 
traversals. Moreover, note that in order for the robot to expand each vertex, he traverses 
at most O(Ei) edges. Thus, the total number of edge traversals for strip i is 0(E{ + y/rVi). 
Combining this with Lemma 2, and noting that r < V, proves the lemma. □ 

Theorem 6 RECURSIVE-STRIP-BFS runs in time 0(E + y 1+0 ( 1 )). 

Proof sketch: First we observe that each vertex is expanded at most once, so there are at 
most 0(E + V) edge traversals due to exploration at line 5 in the bottom level of recursion. 

We now count the edge traversals for relocations. For a particular level- i call of RECURSIVE- 
STRIP-BFS, let Ci denote the number of edge traversals due to relocations, and let Ei denote 
the number of distinct edges that are traversed due to relocation. Let Vi denote the number 
of vertices incident to these edges and whose incident edges are all known at the end of this 
call. Let pi be a uniform upper bound on C 8 /K'. Thus, if the depth of recursion is k then 
the total number of edge traversals is bounded by 0(Vpk)- 

Consider a level- i call. First we count the number of edge traversals for relocation between 
source vertices. Since all the source vertices in the call are connected by a tree of size 
0(Vi), relocating to all source vertices at the start of one strip takes O(K') edge traversals. 
With di/di-i strips and log V iterations per strip, there are Vi log V -r-*— edge traversals for 
relocations between source vertices. 

We now count traversals for recursive calls within the call. Note that our algorithm avoids 
re-exploring previously explored edges. Thus, for a level- i call, when working on a particular 
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strip /, for each iteration within this strip, the sets of vertices whose edges are explored in 
each recursive call are disjoint. Suppose that, in this strip, in one iteration the procedure 
makes k recursive calls, each at level i — 1. Then let C^, 1 < j < k, denote the number of 
edge traversals due to relocations resulting from the j-th recursive call, and let V^{ denote 
the number of vertices adjacent to these edges. Furthermore, let Vij denote the number of 
vertices which are in strip / of this procedure call at recursion level i. Then we would like hrst 
to calculate J2 1= i C 8 -i? which is the number of edge traversals due to relocation in recursive 

calls in one iteration within this strip. This is at most J2 J= i Pi-iK-i = Pi-i Sj=i K-i- Since 
the recursive calls are disjoint, J2 J= i K-i = M,i 5 an d thus the number of edge traversals due 
to relocations in recursive calls in one iteration within this strip is at most pi_iVij. Finally, 
since there are log V iterations in each strip, and all strips are disjoint from each other, the 
number of edge traversals due to recursive calls is at most pi-\Vi log V. 

Finally, note that we relocate once at the end of each procedure call of RECURSIVE- 
STRIP-BFS (see line 19). This results in at most Vi edge traversals. 

Thus, the number of edge traversals due to relocation is described by the recurrence 
Ci < V log V -r 1 — h Pi-iVi log V + Vi. Normalizing, we get the following recurrence: 

P l = (-^- + P i-i)\ogV + 0(l) 



di 



Solving the recurrence for p\. gives: 



P k < iogy(-^Uiog 2 yf^iU... + iog fc yf^Uiog fc yco + y:iog 8 V 




k-2/ \a / 8=0 



d, 

< iogy(-^)+iog^(^)+... + iog fc y(5i)+o(iog fc y) 




dk-2. 

We note that p = 0(1), since at the bottom level, if there are V vertices expanded, then the 
number of edge traversals due to relocation is 0(V). Furthermore, notice that the product 
of the k terms in the recurrence is dk{\ogVp k+1 ' k ' 2 = V(\ogVp k+1 ' k ' 2 (in the worst case), 
and that the sum of these terms is minimized by setting each of theses terms to the k-th. root 
of the product. (Note that this also specifies how to calculate depth &i-\ from depth di.) 

Minimizing, we get: p k < &r 1/,fc (log y)( fc+1 V 2 + 0(log fc V). Choosing k = ( ^ ^ V v ) gives us 

pk = 2 0(^lo g V\o g lo g V)^ and thug Ck ig at mogt V2 0(^/lo g Vlo g lo g V)^ which ig yl+ (l)_ Adding 

the edge traversals for relocation to the edge traversals for exploration gives us 0(E-\-V 1+0 ^ 1 ') 
edge traversals total. □ 
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